Interpolating Conditional Density Trees

نویسندگان

  • Scott Davies
  • Andrew W. Moore
چکیده

Joint distributions over many variables are frequently modeled by decomposing them into products of simpler, lower-dimensional conditional distributions, such as in sparsely connected Bayesian networks. However, au­ tomatically learning such models can be very computationally expensive when there are many datapoints and many continuous vari­ ables with complex nonlinear relationships, particularly when no good ways of decom­ posing the joint distribution are known a pri­ ori. In such situations, previous research has generally focused on the use of discretization techniques in which each continuous vari­ able has a single discretization that is used throughout the entire network. In this paper, we present and compare a wide variety of tree-based algorithms for learning and evaluating conditional density estimates over continuous variables. These trees can be thought of as discretizations that vary ac­ cording to the particular interactions being modeled; however, the density within a given leaf of the tree need not be assumed con­ stant, and we show that such nonuniform leaf densities lead to more accurate density esti­ mation. We have developed Bayesian net­ work structure-learning algorithms that em­ ploy these tree-based conditional density rep­ resentations, and we show that they can be used to practically learn complex joint prob­ ability models over dozens of continuous vari­ ables from thousands of data points. We focus on finding models that are simultaneously ac­ curate, fast to learn, and fast to evaluate once they are learned.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

تأثیر الگوی پراکنش درختان بر برآورد تراکم با روش نمونه برداری نزدیک‌ترین فرد: مطالعات موردی در درختزارهای بنه زاگرس و توده‌های شبیه سازی شده

Distance methods and their estimators of density may have biased measurements unless the studied stand of trees has a random spatial pattern. This study aimed at assessing the effect of spatial arrangement of wild pistachio trees on the results of density estimation by using the nearest individual sampling method in Zagros woodlands, Iran, and applying a correction factor based on the spatial p...

متن کامل

ON THE STATIONARY PROBABILITY DENSITY FUNCTION OF BILINEAR TIME SERIES MODELS: A NUMERICAL APPROACH

In this paper, we show that the Chapman-Kolmogorov formula could be used as a recursive formula for computing the m-step-ahead conditional density of a Markov bilinear model. The stationary marginal probability density function of the model may be approximated by the m-step-ahead conditional density for sufficiently large m.

متن کامل

Tree Representations and the Usability of Interpolating Controllers

Interpolation is ubiquitous as a user interface method: it is here investigated for its usability using both numerical and experimental methods. Three properties—consistency, locality, and monotonicity—are stated to be required for interpolating controllers to be usable. These properties are defined quantitatively. An existing linear representation for tree structures is shown to have relativel...

متن کامل

Bagging classifiers based on Kernel density estimators

A lot of research is being conducted on combining classification rules (classifiers) to produce a single one, known as an ensemble, which in general is more accurate than the individual classifiers making up the ensemble. Two popular methods for creating ensembles are Bagging introduced by Breiman, (1996) and, AdaBoosting by Freund and Schapire (1996). These methods rely on resampling technique...

متن کامل

Recursive partitioning and Bayesian inference on conditional distributions

In this work we introduce a Bayesian framework for nonparametric inference on conditional distributions in the form of a prior called the conditional optional Pólya tree. The prior is constructed based on a two-stage nested procedure, which in the first stage recursively partitions the predictor space, and then in the second generates the conditional distribution on those predictor blocks using...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002